AITopics | deep episodic value iteration

Collaborating Authors

deep episodic value iteration

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

[R] [1705.03562] Deep Episodic Value Iteration for Model-based Meta-Reinforcement Learning • r/MachineLearning

@machinelearnbotMay-11-2017, 02:30:05 GMT

One question though - why have you not directly try it on a standard RL like car pole or some of the Atari games etc... tbh the first time I hear about this Omniglot World task ( I know the dataset but never have seen it been using for RL)

deep episodic value iteration, machine learning, reinforcement learning, (3 more...)

@machinelearnbot

Industry: Leisure & Entertainment > Games > Computer Games (0.91)

Technology:

Information Technology > Communications > Social Media (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (0.40)

Add feedback

Deep Episodic Value Iteration for Model-based Meta-Reinforcement Learning

Hansen, Steven Stenberg

arXiv.org Machine LearningMay-9-2017

We present a new deep meta reinforcement learner, which we call Deep Episodic Value Iteration (DEVI). DEVI uses a deep neural network to learn a similarity metric for a non-parametric model-based reinforcement learning algorithm. Our model is trained end-to-end via back-propagation. Despite being trained using the model-free Q-learning objective, we show that DEVI's model-based internal structure provides `one-shot' transfer to changes in reward and transition structure, even for tasks with very high-dimensional state spaces.

artificial intelligence, machine learning, reinforcement learning, (14 more...)

arXiv.org Machine Learning

1705.03562

Country: North America > United States (0.28)

Genre: Research Report (0.64)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.90)

Add feedback